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1 

Voltage-Gated Potassium 
Channel Genes 



K. George Chandy and George A. Gutman 

2.1.0 Introduction 

2.1 .0.1 Potassium Channels 

Potassium channels play a vital role in the functioning of diverse cell types 
(Rudy, 1988; Hille, 1993). They regulate membrane potential in 
electrically excitable cells such as nerves and muscle, as well as in 
nonexcitable cells such as lymphocytes (Hille, 1993). Although many verte- 
brate K + channel genes have been isolated by molecular cloning (see Gutman 
and Chandy, 1993), and a variety of experimental strategies have defined 
functional domains within the channel proteins (see Miller, 1990, 1992; Jan 
and Jan, 1992), little progress has been made in the biochemical characteriza- 
tion of these proteins. In addition, little effort has been made to define the 
mechanisms that control transcription and functional expression of these 
important molecules. Several excellent reviews have dealt with the structural 
features that underlie particular biophysical and pharmacological properties 
of K + channels (Miller, 1990, 1992; MacKinnon, 1991a; Jan and Jan, 1992; 
Pongs, 1992, 1993), and others discuss the Shaker and related K* channel 
genes in flies (Salkoff et a).. 1992; Pongs, 1993), the mammalian Isk K + 
channel (S wanson et aL, 1 993), and the calcium-activated K + channel (Garcia 
et al., 1 993). This chapter will focus on the mammalian voltage-gated K* (Kv) 
channel gene family. We discuss their amino acid sequence alignments and 
evolutionary relationships, compare their electrophysiological and pharma- 
cological properties, and discuss possible mechanisms that may underlie their 
tissue-specific expression. 

2.1 .0.2 The Extended Kv Channel Gene Family 

Early in 1987, three groups, using genetic techniques in combination with 
molecular strategies, isolated the gene encoded by the Shaker (Sh) locus in 
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Drosophila melanogaster (Kamb et al., 1987; Papazian et al., 1987, Schwarz 
et al., 1988; Pongs ct ah, 1988). Three related fly genes, Shab, Shaw, and 
Shal. were soon isolated, and each of these produced functionally distinct 
channels (Butler etal., 1989, 1990; Wei etal., 1990;Covarnibiasetal., 1991). 
A topology has been proposed for this family of channels (shown in Figure 
1) on the basis of hydropathy plots and structure-function analyses (Miller, 
1990, 1992; Jan and Jan, 1992; Pongs, 1992; Durell and Guy, 1992). The 
channel is thought to have six membrane-spanning segments, termed SI 
through S6, with both the amino- and carboxy-termini located intracellularly. 
The region between the S5 and S6 segments (the "P-region," or "pore") is 
thought to participate in forming the ion conduction pathway, while the S4 
segment forms a major part of the voltage sensor (Miller, 1990; Papazian et 
al, 1991; Jan and Jan, 1992; Pongs, 1992; Durell and Guy, 1992). The loop 
linking SI and S2 has recently been experimentally confirmed to be located 
extracellularly (Chua et al., 1992; Shen et al, 1993). A more detailed descrip- 
tion is provided below. 

Using Shaker probes, Tempel et al. (1988) and Baumann and colleagues 
(1988) isolated the first mammalian homologues of the Shaker gene. In the 
next four years a total of 17 vertebrate genes encoding voltage-gated K + 
channels were isolated from mammals and frogs. The current status of this 
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FIGURE 1 . Schematic diagram of the presumed structure of voltage-gated K + 
channels. The six putative membrane-spanning domains are labeled SI through 
S6, and "P" indicates the P-region, which is thought to form part of the ion 
conducting pore. Three sites of possible posttranslational modification are 
indicated, an N-glycosylation ("CHO") and a tyrosine kinase ("TY-K") site both 
present in mammalian Sha/cer-subfamily channels, and a protein kinase C 
("PKC") site present in all known voltage-gated K + channels (see text). 
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ichwarz continuously growing list is shown in Table 1, which includes the assigned 

aw, and gene names conforming with the recently adopted standardized nomenclature 

distinct , (Chandy et al., 1991). Most of these genes segregate into four clearly defined 

1991) gene subfamilies (Chandy et al., 1991; Gutman and Chandy, 1993; see 

, Figure references in Table 1), each of which (Kvl. 1-1:8, Kv2.1-2.2, Kv3.1-3.4, 

(Miller, Kv4. 1-4.3) is structurally and evolutionary related to one of the four fly 

>2). The genes (Shaker, Shab, Shaw, and Shal). Two additional mammalian genes 

med SI have been isolated (IK8/Kv5.1 and K13/Kv6.1) for which fly homologues 

llulariy. have not yet been identified (Drewe et al., 1992); their inclusion in this 

>ore") is scheme of nomenclature should be treated with caution, as their functionality 

^ the S4 and voltage dependence have yet to be confirmed by expression of the cloned 

>azian et sequences. In addition, two Shaker homologues from Xenopus (XSha 1 , XSha2) 

Che loop have been described and have been incorporated into this standardized no- 

; located menclature. Shaker homologues have also been described in Aplysia (Pfaffinger 

descrip- et al, 1991), in C. elegans (Wei et al., 1991), in leeches (Johansen, 1990), and 

in schistosomes (E. Kim, personal communication). The incorporation of the 

: lleagues . - Aplysia channel (Shen.et al., _1 993) into the standardized nomenclature is 

e. In the currently questionable, as their divergence appears to predate the gene dupli- 

;ated K + cations that gave rise to the multimembered Kv gene subfamilies i n mammals 

s of this and frogs (discussed below). 



2.1 .1 Shaker-Related Gene Subfamily 

2.1.1.1 Protein Sequence Comparisons 

Figure 2 shows the alignment of the amino acid sequences of all known 
Shaker homologues, including those of humans, rats, mice, dogs, cattle, 
frogs, and the mollusc Aplysia, together with the Drosophila Shaker se- 
quence.* An obvious feature of this alignment is the striking sequence 
conservation within certain regions, including most of the hydrophobic core 



* In discussing this and other sequence alignments, we would emphasize several cautionary 
notes: 

1 . Sequences from less than full-length mRNA can obscure authentic start sites (e.g., DRK 1 ), 
or suggest spurious polyadenylation sites (e.g., MBK1). 

2. In the case of protein-coding sequences determined from genomic DNA alone, lacking 
mRNA sequences with which to compare them, the possible presence of unidentified introns 
may obscure the sites of translation initiation or termination. 

3. The presence of unspliced or only partially spliced mRNAs, which appears to be not 
uncommon among K* channels, may mask the presence of introns (e.g., MBK1) and also 
obscure translational start and stop sites (e.g.. Shaker). 

4. Not surprisingly, a variety of unresolved sequencing errors exists in published K* channel 
sequences. While many may be of no major consequence, several cause local shifts in the 
reading frame (e.g., mKv3.3 vs rKShHID, and RCK4 vs RHK1), or alter either translational 
start sites (mKv3.3 vs rKShHID) or termination sites (RK5 vs Shall). 

In our discussion, we have attempted to avoid drawing conclusions from sequence differences 
that are likely to be the result of technical errors, and to rely largely on features that can be 
confirmed by comparison between independently determined sequences of the same gene, or of 
homologous genes between different species. 
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Table 1 a. Nomenclature of Mammalian Voltage-Dependent Potassium Channel Genes' 

Names in use c 



Standard 

names b 

Shaker-Tttettd subfamily I 
Kvl.l (KCNAI) 



Kvl.2 (KCNA2) 



KvI3 (KCNA3) 
Kvl.4 (KCNA4) 
Kvl.5 (KCNAS) 4 

Kvl.6 (KCNA6) 
Kvl.7 (KCNA7) 



Sfcafc-related subfamily 2 

Kv2.1 (KCNB1) 

Kv2.2 (KCNB2) 
S/ww-related subfamily 3 

Kv3.1 (KCNCl) e 

Kv3.2 (KCNC2)* 



Kv3.3 (KCNC3) e 

Kv3.4 (KCNC4) 
SfcaJ-rcIated subfamily 4 
Kv4.1 (KCND1) 
Kv4.2 (KCND2) 

Kv4.3 (KCND3) 
Subfamily 5 

Kv5.1 (KCNF1) 
Subfamily 6 

Kv6.1 (KCNG1) 



Mouse 



Rat 



MBK1 1 

MK1 517 

MK2 5 
MK5 17 



MK3 5 - 11 " 



MK4 17 
mKvI.4 80 



MK2 17 
MK6 56 
MK6 17 
MK4 18 

mShab 19 



NGK2' 0 

mShaw22 22 

mShawU 22 



mKv3.3 24 

mShawl9 22 

mKv3.4 24 

mShal 26 



RCK1 2 

RBK1 7 

RK1 17 - 27 

BK2 6 

RCK5» 

NGK1 10 

RK2 27 

RAK 40 

RH1 72 

RCK3 9 

RGK5' 3 

KV3' 4 

RCK4 9 

RHK1 15 

RK4 17 - 27 

RK8 17 

KWV* 

RK3 27 

RMK2 36 



RCK2 16 

KV2 14 

RK6 17 



Human 



(Chrom.) 



HuKd) 3 - 4 - 41 
HKC-1 71 

HuK(IV) 3 ' 4 - 41 



(12pl3»- 4562JJ0 ) 



DRK1 20 
CDRK 34 

Kv4 2 > 

Raw2 29 

Rshawl2 22 

RKSMIIA 23 

Rawl 29 

rKv3.2b,c 60 

RKSMIID 37 



HuK(M) 3 - 4 

hPCN3 12 

HLK3 31 

HGK5 32 

HuK(II) 344 ' 

hPCN2 ,2 « 

HK1 30 

HuK(VI) 3 - 4 

hPCNl 12 

HK2 30 

HCK1 33 

fHK 73 

HBK2 16 

HuK<V) 34 



hDRKl 67 - 68 



hKv3.1 33 

NGK2-KV4 37 - 38 

HKShinA 37 



Raw3 25 

RK5 27 

Shall 46 

RKShlVB 5435 

IK8 43 

K13 43 



HKShHID 37 

MCV3.3 44 

HKShltIC 28 



(Ipl3.3 31 ) 
(lp21 59 ) 



(llql3-14 37 ) 
(llpl4.l SM0 ) 



(12p 33 - 45 ) 



(19ql3.3 39 - 45 ) 



(11P15 35 - 38 ) 

(11P14.3-15.2 80 ) 

(12 35 - 45 ) 

(19ql3.3-13.4 w ) 



(l9ql3.3-4***«) 
(lp2 1 24 - 28 * 45 ) 
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Table 1a. (Continued) 




See Chandy ct al., Nature 352, 26, 1991; Gutman and Chandy, Senu Neurosci. 5, 101-106, 1993. 
Human locus names assigned by the Human Gene Mapping Workshop (HGMW) are given in paren- 
theses. 

Homologues of Kvl .1 . Kvl .5, Kvl .6 and Kvl .7 have been isolated from hamster, 17 two 5/wter-related 
genes (Xshal and Xsha2) from Xenopus™ homologues of Kvl .2 (BGK5) 69 and Kv 1 .4 (BAK4)« from 
cattle, Kvl.2 (dKvl.2) 70 from dog, and several clones showing similarity with Kvl. 1-1 .4 from rabbit. 42 
Homologues of the Drosophila eag gene have been isolated from mouse (meag), rat (reag), and human 
(heag), representing at least two distinct members of a cyclic nucleotide-binding K+ channel family. 65 - 66 
Genomic and cDNA sequences encoding the IsK/minK channel, a putative single-membrane-spanning 
domain channel, have been isolated from rats, mice and humans; 47 ^ 9 the human homologue has been 
assigned the HGMW name KCNB1 , and the gene has been localized to human chromosome 21 q22. 3S - 45 - 64 
Termed *'KCNA1" in Reference 33. 
Alternately spliced forms are known. 



Table 1b. K + Channel Sequences in GenBank/EMBL Databases 8 





Name b 


Comments 


Access, no. 


Ref. 




mKvl.l 


MBK1 


Y00305 


1 




MK1 


M30439 c . 


5 




rKvl.l 


RCK1 


XI 2589 


2 






RBK1 


M26161 


6,7 




hKvl.l 


HuK(I) 


L02750 


3,4.41 




xKvl.l 


XShal 


M94258 c 


61 




mKvl.2 


MK2 


M30440 c 


5 




rKvl.2 


BK2 


mm 


6 






RCK5 


X16003 


9 






RAK 


M74449 


40 




hKvl.2 


HuK(IV) , 


L02752 


3,4,41 


W.45) 


bKvl.2 


BGK5 e 


L23170 


69 


xKvl.2 


XSha2 


M35664' 


8 




mKvl.3 


MK3 


M30441* 


5,11 




dKvl.2 


dKvl.2 


L19740 


70 




rKvl.3 


RCK3 


X16001 


9 


> 




RGK5 


M30312* 


13 


5-1 5.2 80 ) 




KV3 


M31744 c 


14 


hKvl.3 


hPCN3 
HLK3 


M55515* 
M85217 


12 
31 


M3.4 M ) 




HGK5 


M38217 c 


32 


rKvl.4 


RCK4 


X16002 


9 






RHK1 


M32867 


15 




hKvl.4 


HuK(II) 
hPCN2 


L02751 
M55514 


3,4,41 
12 


28.45) 

\ 




HK1 


M60450 


30 


hKvl.4 




U037322-3 C 


80 


bKvl.4 


BAK4 


X57033 


63 




rKvl.5 


KV1 


M27158 € 


14 




hKvl.5 


hPCNl 


M55513 


12 






HK2 


M60451 


30 






HCK1 


M83254<-< 


33 




mKvl.6 


MKI.6 


M96688 


56 




rKvl.6 


RCK2 


X1762I 


16 






KV2 


M2715^ f 


14 




hKvl.6 


HBK2 


X 17622 


16 



HANDBOOK OF RECEPTORS AND CHANNELS 



Table lb. (Continued) 



Name b 




Access, no. 


Ref. 


mKv2.1 


mShab 


M64228 


1 Q 

19 


rKv2.1 


DRK1 


XI 6476* 


20 


hKv2.1 


hDRKl 


L02840* 


67 


hDRKl 


X68302 


DO 


rKv2.2 


CDRK 


M77482 


3** 


mKv3.1 


NGK2 


Y07521 


lfl 
l\) 


rKv3.l 


KV4 


M6558U 


91 


Raw2 


X6284U 


9Q 
ZV 


hKv3.l 


hKv3.1 






rKv3.2 h 


RKShlHA 


fcAI/IAO 


9^ 


Rawl 


XoZoJy 


9Q 




rKv3.2b 


WCA01 1 


fin 




rKv3.2c 


M59313 


£A 


mKv3.3 


mKv3.3 


X60796-7 C 


\ ft 9 A 


rKv3.3 b 


r» T/'CU.lllPk 

RKSnUID 


MR4910-1 


37 


hKv3.3 


hKv3.3 


LA l!>53^ 


44 


mKv3.4 


mKv3.4 


M8125i LC 


1R 94. 


rKv3.4 


Raw3 


X62841 


Zj 


hKv3.4 


HKShHIC 


M64676 


Oft 

Zo 


mKv4.1 


mShal 


M64226 


20 


rKv4.2 


RK5 


M59980» 


27 




Shall 


c£/09A 

OO^JZU 


46 


rKv5.1 


1K8 


M81783 


43 


rKv6.1 


K13 


M81784 


43 


Other Kv channels 








Shaker 


Drosophila 


Ml /Zl 1 


Shab 


Drosophila 


M32659 


51 


Shal 


Drosophila 


M32660 


51 


Shaw 


Drosophila 


M32661 


51 


KC2.. 


Rabbit 


M8 1350-4'- 


42 


APLK 


Aplysia 


M959I4 


83 


IsK (minK) channels 




S57779 


49 


mlsK 


Mouse heart 


rlsK 


Rat kidney 


M22412 


47 


hlsK 


Human 


M26685 c 


48 



Other K + channels 
Eag 
Slo h 

mSlo 

KAT1 

AKT1 

ROMK1 

IRKl 
GIRK1 
ECOKCH 



Drosophila 

Drosophila 

Drosophila 

Mouse 

A. thaliana 

A. thaliana 

Rat 

Rat 

Rat 

E. coli 



M6U57 

M69053 e 

M96840 

L16912 

M86990 

X62907 

X72341 

X73052 

L25264, U09243 
L12044 



52 
53 
82 
74 
75 
76 
77 
78 

85,86 
79 



cDNA sequences, unless otherwise indicated. 

Prefixes to standard names: m. mouse; r, rat; h, human; b, bovine; x, 
Xenopus. 

Genomic sequence. 

1 bp insertion causes early termination (compare with HK2, HPCN1). 
Partial coding sequence only. 



Table lb. 

f Fullpubl 

1 Pubtishet 

h Alternate 

1 2 bp dele 

Table 1 ref 

1. Tern 

2. Baui 

3. Karr. 
86, ' 

4. Mail 
Soc. 

5. Chai 
Guti 

6. Met 

7. Chri 

8. Ribt 

9. Stub 
Pers 

10. Yok 
(19? 

11. Gris 
Cha 

12. Phil. 
D. r 

13. Dou 
/. In 

14. Swa 
Ant: 
Neu 

15. Tsei 
68. 

16. Gru* 
(19S 

17. Bet* 
A., 

18. Cha 
Ten 

19. Pak 

20. Fret 
{Un 

21. Lur. 
Fol: 

22. Pak 

23. Mc« 
87, 

24. Gh; 
Gui 

25. Sch 
FE, 

26. Pak 
Nat 

27. Rol 
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Table lb. (Continued) 

r Full published sequence not incorporated. 

* Published correction in start site (see Reference 43) not incorporated. 
h Alternately spliced cDNAs. 

» 2 bp deletion results in early termination (compare with Shall). 
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of the protein (the region that includes the six membrane-spanning domains 

51 through S6, and the P or pore-forming region), as well as a substantial 
portion of sequence on the N-terminal side of SI. Equally striking is the 
substantial divergence of sequences at both ends of the protein, as well as 
within two regions that are thought to form extracellular loops (between SI/ 

52 and S3/S4). 

2.1.1.1.1 Divergence of N- and C-Terminal Regions; Repetitive 
Sequences 

The extensive sequence divergence at both ends of these proteins suggests 
either that these regions have very different functions in the different channel 
isoforms, or that whatever functions they do have depend relatively little on 
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their structure (since a region that is completely functionless would be 
expected to be progressively lost over evolutionary time). A clue to the nature 
of such functions may be provided by the fact that the Kvl.4 proteins have 
the longest N-terminal region of the mammalian Shaker subfamily, and also 
contain some strikingly repetitive sequences (e.g., runs of 1 1/13 alanines, 14/ 
15 glutamines, as well as shorter runs of glycines, histidines and lysines); the 
presence of glutamine repeats in the N-terminal portion of Shaker has already 
been noted (Schwarz et al„ 1988). 

Simple repetitive sequences are widely dispersed throughout the ge- 
nomes of eukaryotes, recent highly publicized examples including the tri- 
nucleotide repeats associated with mutations in the gene responsible for 
Huntington's disease (Huntington's Disease Collaborative Research Group, 
1993), and the dinucleotide repeats reported to be associated with a predis- 
position to colorectal cancer (Aaltonen et aU 1993; Thibodeau et al., 1993). 
One mechanism by which simple repetitive sequences may be expanded is 
slipped-strand mispairing. It has been suggested that in nucleotide sequences 
constrained only by the requirement for their presence (and not for a particu- 
lar structure) one might expect a progressive replacement by simple repeats 
(Le Vinson and Gutman, 1987), and the presence of such simple repeats in 
Shaker and Shaker- related genes might be the result of such a process. The 
required conditions could be fulfilled if, for example, the nonconserved 
region were required only to provide a suitable spacer arm ("chain") for the 
N-terminal inactivating "ball" known to be responsible for fast inactivation 
in Shaker (Jan and Jan, 1992). Kvl.4, in fact, is the only mammalian Shaker 
homologue that shows a Shaker~\\ke rapid inactivation, and it also contains 
the longest N-terminal region as well as the most striking amino acid repeats 
within this family; these repeats begin immediately after a stretch of charged 
residues that are thought to form the fast-inactivation ball of this channel 
(indicated in Figure 2; see also Section 2.1.5). While other members of this 
family are delayed rectifiers and do not exhibit rapid "ball and chain" type 
inactivation, analogous "spacer" regions might be required for structures 
involved in subunit assembly or other functions, and the presence of simple 
amino acid repeats might be associated with such spacers. 

.1.1.2 Posttranslational Modification 

Three potential sites of posttranslational modification are indicated in Figure 
2 (as well as Figure 1). First, an N-glycosylation site (Marshall, 1972) is 
present in the extracellular loop between SI and S2 in all the mammalian 
.S/iater-related sequences shown in Figure 2 other than Kvl.6; the absence of 
a glycosylation site in this region is shared by both mouse and human Kvl.6. 
Two such sites are evident in the corresponding region of Aplysia and Shaker 
K* channels. The presence of such a site in almost all of this subfamily of 
channels is all the more striking given the considerable divergence in both the 
sequence and length of the S1-S2 loop. Using an in vitro translation system, 
Shen et al. (1993) demonstrated glycosylation at this site in the Aplysia K + 
channel, N-glycosylation of the Shaker channel has also been shown 
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(Rosenberg and East, 1992; Santacruz-Toloza et al., 1994); it has not yet been 
determined, however, whether this site is glycosylated in native channels. 
Other N-gly cosy lation consensus sequences exist in these and other channels, 
including several in the C-terminal region, but as these are thought to be 
located intracellularly they are unlikely to be utilized and we do not indicate 
. them in any of our alignments. 

Second, all Stater-related proteins have a potential tyrosine kinase site 
in the N-terminal region (RPSFDAILY), indicated in Figure 2; while its 
conservation suggests it is functional, this site has not been experimentally 
shown to be utilized, and deletion of this region from the mKvl .3 channel had 
no apparent affect on channel function (Aiyar et al., 1993a). Third, all of the 
Shaker- subfamily proteins show one or two protein kinase C (PKC) consen- 
sus sites (Ser/Thr-X-Arg/Lys) in the cytoplasmic loop between S4 and S5 
(Kemp and Pearson, 1990), a region that has been modeled to form part of the 
internal surface of the ion conduction pathway (Isacoff et al., 1991; Jan and 
Jan, 1992; Durell and Guy, 1992). In fact, a PKC site in this region is present 
in all known voltage-gated K + channels (further discussed below), and there 
is evidence that phosphorylation of this site can affect channel function. 
Phorbol esters, which activate PKC, suppress Kvl.3 currents in oocytes and 
in Jurkat T cells, probably by phosphorylating these sites, and the effect is 
reversibly inhibited by PKC inhibitors (Attali et al, 1992a; Payet and Dupuis, 
1992; Aiyar et al., 1993b). Rat Kvl.3, translated in vitro, has been reported 
to be phosphorylated by PKC, presumably at one or both of these sites (Cai 
and Douglass, 1993). Similar results have been reported for the rKv3. 1 (Critz 
et al, 1993), hKvl.4 (Fahrig et al., 1992), rKv4.2 (Blair et al, 1992), and 
Shaker channels (Moran et al., 1991), and removal of both PKC sites by site- 
directed mutation (K — » D and R — > D) in the Shaker protein produced 
nonfunctional channels* (Isacoff et al., 1992). 

2.1.1.1.3 Species Conservation 

The mammalian S/zater-related genes generally show a high level of inter- 
species conservation. For example, comparison of the Kvl.2 protein se- 
quences of human, rat, mouse, dog, and cattle reveals greater than 98% 
identity over the entire protein (Figure 3A); even the more distantly related 
Xenopus sequence is greater than 91% identical to the mouse. This is not 
universally true, however, and one exception is illustrated in Figure 3B, 
which shows that rat and human Kvl.5 are only 87% identical overall 
(ignoring one 1 1-residue size difference), and less than 64% identical in the 
amino-terminal 100 residues. The more typical Kvl.2, on the other hand, 
shows 99.4% overall identity between its human and rodent homologues. 
These apparent differences in rates of divergence are not the consequence of 
sequencing artifacts, since the nucleotide sequences of the five human Kvl.5 
cDNAs that have been independently isolated are identical in this region 
(Ramaswami et al., 1990; Philipson et al, 1991 ; Tamkun et al., 1991 ; Curran 
et al., 1992; Fedida et al., 1993), as are the sequences of three independent rat 
Kvl.5 clones (Swanson et al., 1990). 



. not yet been 
ve channels, 
her channels, 
nought to be 
» not indicate 

ie kinase site 
2; while its 
perimentally 
l channel had 
ird, all of the 
KC) consen- 
n S4 and S5 
-m part of the 
991; Jan and 
;on is present 
vv), and there 
nel function. 
• oocytes and 
the effect is 
l and Dupuis, 
>een reported 
jse sites (Cai 
Kv3.1 (Critz 
, 1992), and 
sites by site- 
>in produced 



:vel of inter- 
I protein se- 
er than 98% 
antly related 
. This is not 
i Figure 3B, 
itical overall 
entical in the 
5 other hand, 
homologues. 
nsequence of 
luman Kvl.5 
n this region 
l991;Curran 
lependent rat 



Voltage-Gated K + Channel Genes 



3 

o 

>- 
>- 



id 
a. 

-8 



2 
_§ 

3C 
O 

a. 

3 

ut 



1 

I 



u. a 

Ii 1 



IA to /-s-x — v\ a l/> <n 

CCOX COM IOX COX ZCDX CO K 



ojrvrMcvrvjrvi 



>>>>>> >>>>>> 



HI II 
rvcMivrurutv 



> > >> >> >>>^>> 



ruf\ir\jrvryr\» 



CD -= 

o u * 

»/> c .E 

m d) c 

3 d c 

O CD *3 

E 
o 



u cu 

03 -C 



DO £ 
O .E 

- £ 



CD 
U 
C 
CD 
DO 



8*1 

C 



Lft _ 

_ .E ^ 

CL CD $ 



2 3 S 

-D- CD 
-Q CD 
O vft ro 

"° S 3 

^ O v> 

5 "do 2 

-C c CD 

<u .2 ■£ <N 

^ U DO CD 
£ v2 £ DO 

2 | « c 
o S *o £ 

c - * S 

c nj o.y 

CD C 
- D '** 
CD DO CD 



CO 



c 

2 

DO 
CU 



c ~ ■= 

CD CD O 
3 CD E 
£T DO o 

y do c m 

^ 3 S 

o _c o 

E -£ c c 

< do " ^ 

< I 2? 

. « ° 2 
co _2 t± =3 

UJ = g a « 
3 ■ C O 

O ^ .5P do 

a: £ T3 s 



15 



Voltage-Gated K* Channel Genes 



17 



< APLK 

1 Shaker 

FIGURE 4. Proposed phylogenetic tree of Shaker-related K + channel genes, 
based on parsimony analysis of nucleotide sequence alignments using the 
program PAUP (Swofford, 1 993). The placement of mKv! .7 is poorly supported 
(see text). 

Two additional Shake r-xt\ditd mammalian genes are known to exist, but 
sequences for these genes either exist only as short published fragments 
[mouse and rat Kvl.7 (Betsholtz et al., 1990)] or have not yet been published 
[mouse Kvl .8 (B. Tempel, personal communication); mouse KvL7, K. Kalman 
et al., unpublished data]. 

2.1.1.1,4 SAafcer-Subfamily Phylogenetic Tree 

A proposed phylogeny for the Shaker- related genes is shown in Figure 4. 
While our earlier analyses placed the Xenopus Xsha2 gene outside the 
mammalian group (Strong et al., 1993), the current larger dataset places 
each of the two Xenopus genes within one of the mammalian groups of 
homologues, Xshal with Kvl.l and Xsha2 with Kvl. 2. This larger dataset 
also positions the Kv 1 .2 group differently than previously, making it a sister 
group of the Kvl.l cluster. The remaining features of this tree are the same 
as we have already proposed (Strong et al., 1993), including our inability 
to determine the position the mouse Kvl.7 sequence in this tree with any 
degree of confidence. It should be noted that the terminal patterns of 
branching in the Kvl. 2 and Kvl. 4 clusters, the two that include bovine 
homologues as well as rodent and human, are inconsistent with each other. 
Neither of these patterns is compellingly supported by bootstrap analysis, 
and this inconsistency reflects the current uncertainty regarding the true 
phylogenetic relationships between many mammalian orders (Novacek, 
1992; Graur, 1993). 

If this phylogeny is correct, then the duplications that gave rise to the 
mammalian Shaker- related subfamily of genes must have occurred before the 
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divergence of mammals and amphibians, some ^^Z^ 
for discussion of times of divergence see N«. 987 JJ-J^ 

maZ group implies that the vertebrate duplicaaons occurred aft er the 
divergence of moSuscs as well as insects from the vertebrate ^(jMO 

mined. 



2.1.1.2 Genomic Organization 



The Shaker eene in Drosophila contains multiple exons spread over 120 kb 
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,990; Wymore et al., 1 994; B. Tempel, persona communication ) 

g ene is I o"^^^ 

^^Jt^^^^^^ regi ° nS W ° Uld 
2t££E* Tg^oHf diverse channel proteins through alternate 
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t potential sites within known coding regions (see cauUO* ; J*£j£ 
ouence interpretation in footnote). ThcXenopus Kvl.l and Kv .2 genes also 

their coding regions (Ribera, 1990; 
suiting that this genomic organization may be common to all vertebrates^ 
ThfevoSonary significance of this difference in the ^ 

vertebrate channel genes and the mechan.sms by wh.ch tms fanuly of mtronless 

genes have evolved remain obscure. 

2.1.1.2.1 5 -Noncoding Introns in Kvl.l and Kvl.2; 
Posttranscriptional Regulation? ■ h 

Kvl l-Kvl.3 genes express several large transcripts (8-9 5 kb) along .with 
Several smaHef ones (Tempel et al., 1988; Stuhmer et al., 1989; McKmnon, 
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FIGURE 5. (A) Nucleotide sequence alignment of a portion of the genomic 5' untranslated 
sequence (5'-UTS) of mouse mKvl .1 (from position -886 to -51 4 relative to the beginning 
of the MK1 protein-coding region), with two distinct mKvl .1 cDNAs (MBK1a and b). The 
borders of an intervening sequence present both in the genomic sequence and in MBK1 a 
are indicated by brackets. Three potential translation start sites (PNNATG, where P 
indicates either A or G, and N indicates any nucleotide) are boxed, and the lengths of their 
downstream open reading frames are indicated above. Two sites that display the preferred 
initiation sequence, ANNATG, are indicated by asterisks. (B) Alignment of a portion of the 
genomic 5'-UTS of mouse Kv1 .2 (from position -1111 to -902) with that of a rat Kv1 .2 
cDNA. The 3' end of a putative intervening sequence is indicated by a bracket, and a 
potential translational start site is indicated as in A. (C) Alignment of another region of the 
genomic 5'-UTS of mouse Kv1.2 (from position -607 to -136) with that of a rat Kv1.2 
cDNA, showing the presence of an intron whose ends are indicated by brackets. Six 
potential translation start sites are indicated as in A and B. 
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1989- Douglass et at., 1990; Spencer et al., 1993). In the case of mKvl.l, 
alternative splicing of a short intron within its 5'-NCR has been shown to 
yield both spliced and unspliced polyadenylated RNA (Tempel et al., 1988). 
As seen in Figure 5A, this 5' intron (position -886 to -514) contains three 
potential initiation codons (consensus sequence PNNATG, where P indi- 
cates either A or G, and N is any nucleotide), each generating an open 
reading frame whose length is indicated above the ATG codon; two of these 
(indicated by asterisks) show the preferred consensus sequence ANNATG. 
According to the scanning model for translation initiation (Kozak, 1991). 
these upstream ATGs could inhibit translation of the Kvl.l protein by 
delaying the arrival of scanning ribosomes at the authentic start site; Kv 1 . 1 
protein might therefore be inefficiently produced from transcripts which 
contain this intron. ' 

Comparison of the mKv 1 .2 genomic sequence with that of rKv 1 .2 cDNA 
also reveals the presence of at least two introns in the 5'-NCR (Figure 5B,C). 
We have identified only the splice acceptor site of the upstream intron. at 
position -920 (Figure 5B), and this putative intron contains at least one ATG 
initiation codon. The downstream intron (-607 to -136) contains 6 possible 
translational start sites producing open reading frames ranging from 1 8 to 1 56 
bp (Figure 5C), two of which match the preferred Kozak consensus sequence. 

The existence of 5' introns containing potential initiation codons raises 
the intriguing possibility of post-transcriptional regulation of K* channel 
expression by alternative splicing in the 5'-UTS. Kozak (1991) has reviewed 
examples of messenger RNAs of differing translational efficiency being 
produced by alternative sites of transcription initiation, and of the demon- 
strated deleterious effects of upstream ATG codons on translational effi- 
ciency. Her cautionary notes regarding the difficulty of unambiguously iden- 
tifying the structure of functional mRNAs are particularly relevant to our 
discussion; in the case of Kvl . 1 , for example, neither end of the transcription 
unit has been identified, and the intron-containing form of mRNA has not 
been demonstrated to be functional. Nevertheless, mRNAs containing this 
intron appear to constitute a substantial minority of Kvl.l messenger RNA 
in mouse brain (Tempel et al., 1988), and alternative splicing of such introns 
could potentially play a role in both the tissue specificity of expression of Kv 
proteins and in their up- and down-regulation. 

211.2.2 Mapping Kvl.4 and Kvl.5 mRNAs 

The transcripts of Kvl.4 and Kvl.5 are generally shorter (2.4-4.5 kb) than 
those of Kvl. 1-Kv 1.3 (Tamkun et al., 1991; Swanson et al., 1990; Philipson 
et al 1990). A schematic diagram of the mKvl.4 gene and its transcripts is 
shown in Figure 6. mKvl.4 appears to have a single, large intron (3.4 kb) in 
its 5'-NCR (Wymore et al., 1994). Like most other members of the Shaker- 
related family, however, its protein coding sequence, as well as an additional 
2 kb of 5'- and 3'-UTS, is contiguous in the genome. Although neither the 
transcription initiation site nor the 3' polyadenylation site of this mRNA has 
been definitively identified, they are unlikely to be far from the limits indi- 
cated in the figure, defined by the 5' end of hPCN2 and the 3' end of RHK1 
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FIGURE 6. Schematic diagram (above) of Kv1.4 transcripts from human 
(hPCN2), cattle (BAK4), rat (RCK4 and RHKI), and genomic DNA from mouse 
Kv1 4 The shaded box encloses the protein coding region. Within the 3 -UTS 
are indicated the positions of potential polyadenylation signals (tall arrow- 
heads) and ATTTA motifs (short arrowheads) implicated in destab.l.zation of 
mRNA (see text). The aligned nucleotide sequences for the boxed region i m the 
3'-UTS are shown below, with the polyadenylation signals and ATTA se- 
quences indicated. The ruler above indicates nucleotides (in kb) beginning w.th 
the 5' end of the hPCN2 transcript. 

A total of more than 4.4 kb of the Kvl.4 transcript is thus accounted for in 
the mouse genomic sequence, leaving no more than about 100 bp possibly 
remaining at both ends of the longest message. 

Unusual features of the 3'-UTS of Kvl .4 suggest the possibility of this 
region playing a role in posttranscriptional regulation of expression. The rat, 
mouse, and bovine Kvl.4 genes have at least three polyadenylation recogni- 
tion signals in the 3'-NCR. which are indicated in Figure 6. Differential 
utilization of these signals may be partially responsible for the generation of 
the three Kvl.4 mRNAs (2.4, 3.5, and 4.5 kb), although omission of a large 
portion of the 5'-NCR (through an alternative transcriptional start site, or 
splicing of an unidentified intron) would also need to be invoked toaccount 
for a functional message as small as 2.4 kb. Many conserved ATTTA se- 
quences are also present in the 3'-NCRs of the rat, mouse, and bovine Kvl .4 
genes (also shown in Figure 6), motifs that have been reported to reduce 
mRNA stability when present in the 3'-NCRs of cytokine genes (Shaw and 
Kamen 1986). If the presence of these sequences has a destabilizing ettect 
on Kvl.4 transcripts, then one might expect that the shortest Kvl .4 mRNAs, 
those containing the fewest ATTTA sequences, would be more stable than 
longer transcripts and may therefore be the most efficiently translated Such 
AT rich sequences have also been reported to interfere with ^»ti°n 
independently of their effects on message stability (Kruys et al.. 1989). It is 
worth noting that the shortest Kvl .4 transcripts, 2.4 kb in length, can have no 
more than 400 nt of total UTS; therefore, if they have intact protein coding 
regions, they must lack all the ATTTA motifs indicated in Figure 6. 

The 5'-NCR of rKvl.5 as well as its protein coding region appear to be 
intronless, and its transcription start site has been reported to be located 774 
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bp upstream of the initiating ATG (Mori et al., 1993). A cAMP response 
element located in the 5'-NCR at position +636 (relative to the transcription 
start site) confers cAMP responsiveness and binds CREB and CREM DNA- 
binding proteins (Mori et al., 1993). Recently, Shelton et al. (1993) have used 
the (J-globin locus-control region (LCR) to direct transcription from the 
hKvl .5 promoter region, located just upstream of the coding region, resulting 
in the expression of hKvl. 5 currents in mouse erythroleukemic cells. 

2.1.1.2.3 Chromosomal Locations 

The Kvl.l, KvL5, and Kvl.6 genes have been mapped to mouse chromo- 
some 6 (Lock et al., 1994) near the opisthotonus and deafwaddler loci, and 
have been recovered together on a single yeast artificial chromosome (YAC) 
about 275 kb in length (Migeon et al., 1992). No recombination was detected 
between these three genes in 1 13 backcrosses studied, confirming their close 
proximity (Lock et al., 1994). Human Kvl.l has been localized to human 
1 2p 1 3, a region homologous to mouse chromosome 6 (Grissmer et al., 1992; 
Wymore et al., 1994); hKvl.5 and hKvl .6 would therefore also be expected 
to map to this region. In fact, hKvl. 5 has been mapped to human 12pl3 
(Curran et al., 1992; Attali et al., 1993). Mutations in hKvl.l, 1.5, or 1.6 
might therefore result in human diseases linked to human 12pl3, possibly 
resembling opisthotonus or deaf waddler, although none have yet been 
described. Trisomy of the 12p region is associated with an epileptiform 
disorder (Guerrini et al., 1990), and it is tempting to implicate one of these 
three K + channel genes in this disorder. One other Shaker-subfamily gene has 
been reported to be on human chromosome 12, namely Kvl.2 (Grissmer et 
al., 1990); however, its mouse homologue maps to mouse chromosome 3, 
which is homologous to human lp (Lock et al., 1994), and this conflict needs 
to be resolved. 

Human Kvl.3 has been mapped to human chromosome Ipl3/lp21 
(Table 1; Attali et al., 1992b); its mouse homologue mKvl.3 as well as 
mKv 1 .8 have also been mapped to mouse chromosome 3 (Lock et al., 1 994). 
Kvl.7 lies on human 19ql3.3 (K. Kalman et al., unpublished), and on mouse 
chromosome 7 in close proximity to mKv3.3 (K. Kalman et al., unpublished). 
The recessive mutant mouse gene quivering is also located in this region. 

The defective gene for some forms of the long QT (LQT) syndrome, an 
autosomal dominant cardiac disease, lies within 10 cM of H-ray at human 
chromosome llpl5.5 (Keating et al., 1991; Wymore et al, 1994). The QT 
electrocardiogram interval, which is abnormally long in this disease, is a 
measure of cardiac repolarization. Since K + channel opening during the action 
potential is responsible for cardiac repolarization, a defect in a cardiac Kv gene 
such as Kvl.4 could lead to the prolongation of the QT interval, and conse- 
quently to the LQT syndrome. In the mouse, Kv 1 .4 is located close to the gene 
encoding FSHB (follicle stimulating hormone B) on chromosome 2 (Wymore 
et al., 1994), and it occupies a homologous position at human chromosome 
1 lpl4.3, approximately 25 Mb from U-ros (Gessler et al., 1993; Wymore et al., 
1994), Thus, Kvl.4 appears to be too far from H-ras to be considered a 
candidate gene at least for this form of the LQT syndrome. However, the anx 
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gene that causes anorexia, body tremors, and uncoordinated gait in mice also 
maps to mouse chromosome 2 in the same region as mKvl.4 (Maltais et al, 
1984). 

2.1 .2 Shab-, Shaw-, and 5/ia/-Subfamily Genes 

2.1 .2.1 5/iab-Subfamily 

Only two mammalian S/wfc-related isoforms are known, Kv2.1 and Kv2.2, 
represented by five cDNA sequences, one from mouse and two each from rat 
and human (Table 1). An amino acid sequence alignment of the rat homo- 
logues of Kv2.1 and Kv2.2, together with the fly Shab gene, is shown in 
Figure 7. Some of the features discussed for the Shaker alignment hold for 
this comparison as well, namely the striking conservation of the hydrophobic 
core of the proteins and the presence of a PKC site in the S4-S5 intracellular 
loop. However, the S1/S2 loop, which in the Shaker channels is highly 
variable in length and in sequence, is represented by a short conserved 
segment in the S/iafc-related proteins; while the Drosophila Shab protein has 
an N-glycosylation site in this segment (as do most Shaker-related channels), 
this site is absent from both of its mammalian homologues. The S3-S4 
extracellular loop is also shorter and more highly conserved in Stafc-subfam- 
ily genes, and this loop contains an N-glycosylation site in all three proteins. 
A region of 200 N-terminal residues adjacent to SI is very highly conserved 
between all three proteins, while the Shab protein has about 240 additional 
residues at its N-terminus, marked by the presence of some strikingly repeti- 
tive segments (e.g., poly-Glu, poly-GlyAla). On the other hand, the long C- 
terminal region beginning about 100 residues past the end of S6 (the longest 
among all mammalian Kv proteins) shows very little sequence conservation, 
a feature similar to the Shaker comparisons. It remains possible, however, 
that alternate splicing at the 3' end of the coding regions of the Shab genes 
may contribute to this diversity, as it does for Shaw- and 5/ia/-related chan- 
nels (see below), since nothing is yet known about the intron/exon structure 
of the S/wb-subfamily genes. Kv2.1 and Kv2.2 are located on mouse chromo- 
somes 2 (some distance from mKv 1 .4) and 1 , respectively (Lock et al., 1 994). 

2.1.2.2 S/iaw-Subfamily 

2.1.2.2.1 Amino Acid Sequence Alignment 

Four mammalian Shaw homologues have been described (Kv3.1-Kv3.4; 
Table 1), and an alignment of four representative protein sequences from rats 
together with the Shaw sequence is shown in Figure 8. The hydrophobic core 
is highly conserved, as well as a ~70-residue region within the N-terminal 
region, but here separated from the hydrophobic core by a highly variable 
segment of 50-70 residues. Unlike the mammalian Shaker homologues that 
have seven positive charges as part of the repeated motif in S4 (a positively 
charged residue at every third position), the Shaw channel has only four 
positive charges in the region, which has been aligned under the mammalian 
S4, lacking the first two arginines through one deletion and one substitution. 
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However, an additional arginine is present six residues downstream, which is 
in phase with the last arginine in the repeated motif. Given the three-residue 
deletion in Shaw relative to its mammalian homologues, its S4 region should 
probably be extended to this arginine (as indicated by the bracket below the 
alignment), thus bringing to five the total number of positive charges in S4, 
which might contribute to voltage sensing (discussed further below). 

2.1.2.2.2 Posttranslational Modification 

Two N-glycosylation sites are present in the poorly conserved S1/S2 loop of 
all of these S/zaw-related proteins, and the universal PKC site is present 
between S4 and S5. There is a highly variable C-terminal region, although 
alternate mRNA splicing (discussed below) is known to contribute to this 
diversity. Regions containing short amino acid repeats are evident, particu- 
larly in the N-terminal region of Kv3.2 and in the C-terminal insertion just 
outside the conserved core in Kv3.3. 



2.1.2.2.3 Introns and Alternative Splicing 

The transcripts of all four Shaw genes are heterogeneous and include large 
forms, ranging from 4.5-13 kb, and none has yet been mapped. While 
complete genomic sequence is not available for any Sftaw-related protein, one 
intron is known to be present at the 5' margin of the SI transmembrane 
segment in Kv3.1, 3.3, and 3.4 (Ghanshani et al., 1992; Luneau et al., 
1991 a,b). In addition, alternative splicing is known to contribute to the 
diversity of the C-termini of Kv3.1 and Kv3.2; the Kv3.1b channel differs 
from its alternately spliced counterpart, Kv3. 1 a, in that the last 1 0 amino acids 
of Kv3 . 1 a are replaced by an unrelated sequence of 84 amino acids in Kv3. 1 b. 
In the case of Kv3.2, alternative splicing at their 3' ends can yield four distinct 
KV3.2 mRNAs, as shown schematically in Figure 9. Two of these transcripts 
(RAWl/rKShIIIA2 and rKShlllA) share a common 3' segment, indicated by 
cross-hatching; this shared region does not contribute to the translated prod- 
uct of the RAWl/rKShIIIA2 transcript, however, due to the presence of a 
termination codon (indicated by a heavy bar) in the shorter exon, which is 
absent from the rKShlllA transcript (Luneau et al., 1991b; Rettig et al., 1992). 
While these two exons are shown as being separated by an intervening 
sequence, the existence of such an intron is unproven, since the genomic 
sequence is not known; the two exons might be contiguous in the genome, as 
they are in the RAWl/rKShIIIA2 transcript, and the rKShlllA transcript 
would then result from the use of a splice acceptor site within this exon. The 
same caution holds true for many other Kv genes for which multiple tran- 
scripts are known but no genomic sequence is available. 

2.1.2.2.4 Sftaw-Subfamily Phylogenetic Tree 

Figure 10 shows a proposed phylogenetic tree of all known S/ww-related 
genes, resolving the ambiguities in our earlier analyses, which resulted from 
a paucity of data (Strong et al., 1993). This phylogeny implies that three 
separate gene duplications of a S/ww-related ancestor gave rise sequentially 
to the four known mammalian Shaw homologues. No nonmammalian Shaw- 
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FIGURE 9. Four alternatively spliced variants at the 3' end of rat Kv3.2 
mRNAs, showing the hypothetical splicing patterns. Four distinct exon-contain- 
ing regions are indicated by different fill patterns, and the positions of transla- 
tion termination codons are indicated by solid vertical bars. The scale bar 
indicates a length of 60 nucleotides. 
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FIGURE 10. Proposed phylogenetic tree of Sha w-related K + channel genes, 
based on parsimony analysis of nucleotide sequence alignments using the 
program PAUP (Swofford, 1993). 

related sequences are yet available to determine at what period in the verte- 
brate radiation these duplications occurred. 

2.1.2.2.5 Chromosomal Locations 

The four S/za w-related genes (Kv3.1, Kv3.2, Kv3.3, Kv3.4) have been re- 
ported to be located on human chromosomes 11, 12/19, 19, and 1, respec- 
tively (see references in Table 1). Kv3.1 has been sublocalized to human 
chromosome llpl4.3-15.2 (Reid et al., 1993; Wymore et al., 1994); it 
appears to be about >25 Mb away from H-raj, and therefore is unlikely to be 
a candidate gene for the H-ras-linked form of the LQT syndrome (Wymore 
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et al., 1994). In the mouse, Kv3.1 is within 1 cM of myoDl on chromosome 
7 (Wymore et al., 1994), a region that is homologous to human 1 lpl5. 1/15.2 
(Reid et al., 1993; Wymore et al, 1994). While Kv3.2 maps to mouse 
chromosome 10 in a region homologous to human 12q (Lock et al., 1994), 
conflicting reports have placed it on human 12 (Cuiran et al., 1 992; McPherson 
et al., 1991) and on 19ql3.3-13.4 (Haas et al., 1993). Kv3.3 lies close to 
Kvl/7 both in the mouse (on chromosome 7) and in humans (on human r 
I9ql3.3; K. Kalman et al., unpublished data). Kv3.4 has been mapped to 
mouse chromosome 3 and the homologous region of human lp2l (Rudy et 
al., 1991a; Ghanshani et al., 1992; B. Tempel, personal communication). 

2.1*2.3 Sfca/Subfamily 

An alignment of proteins representing two mammalian isoforms of Shal- 
related channels is shown in Figure 11, together with that of the Drosophila 
Shal protein; one additional S/w/-related mammalian gene, Kv4.3, is known 
to exist, but its sequence has not yet been published (Rudy et al., 1991b). The 
hydrophobic core is highly conserved, and the universal PKC site is present 
between S4 and S5. A potential tyrosine kinase site is present in the N- 
terminal region of Kv4.2, but not Kv4.1, and the only potential external N- 
glycosylation site is in the extracellular loop linking S5 to the P-region of 
Kv4.1 and Shal (but not Kv4.2). Interestingly, sodium channel proteins are 
known to be heavily N-glycosylated, at a position thought to be between S5 
and the pore region (Miller et al., 1983; James and Agnew, 1987; Ukomadu 
et al, 1992). The S4 segment of the MaZ-subfamily genes contains only six 
positive charges that are part of the repeated motif, in contrast to the 7 positive 
charges present in ShaJter-subfamily genes. Like the Shab subfamily (but 
unlike the Shaker and Shaw famiiies) r the N- terminal region of Shal subfam- 
ily genes is almost perfectly conserved in its length, and substantially con- 
served in its sequence. While the C-terminal regions show extensive se- 
quence divergence, nothing is known of possible alternative 3' splicing within 
these genes. Kv4.1 is located on the mouse X chromosome (Lock et al., 
1994). 

2.1 .3 The Extended Voltage-Gated K + Channel Gene 
Family: Amino Acid Sequence Comparisons 

Figure 12 shows an alignment of the amino acid sequences of 16 mammalian 
K + channel genes, one representative of every known mammalian Kv gene, 
together with the four known fly homologues. There is clearly substantial 
sequence conservation in a large portion of the hydrophobic core of these, 
proteins, as well as a more limited region in the N-terminal portion, about 1 00 
residues away from SI; these relatively conserved regions are indicated by 
brackets below the alignment. Within these regions, comprising some 262 
residues altogether, 44 positions show complete identity, among all 20 se- 
quences, while at an additional 65 positions only conservative substitutions 
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are seen. In current models of K* channels, the six membrane-spanning 
domains define the overall architecture of the channels and determine its 
gating properties, and the P-region participates in forming the ion-conducting 
pore (Durell and Guy, 1992); substantial conservation of these sequences is 
therefore not difficult to understand. 



2.1.3.1 Amino and Carboxy-Terrninal Regions 

While size differences within the hydrophobic core of Kv channels is limited 
to the regions between the putative membrane-spanning segments, there is 
considerable diversity in the length of their C- and N-terminal regions, as 
illustrated schematically in Figure 13. The Mater-related channels all have 
longer N-terminal regions than C-terminal; the Sfazb-related proteins are the 
longest overall, and its mammalian homologues have the longest known C- 
termini, while the Shab protein has the longest N-terminus among all known 
Kv channels. Alternative RNA splicing can contribute substantially to diver- 
sity in the lengths of these regions, the C-terminus of Kv3.3 (for example) 
varying more than twofold in length in alternate forms. 

While this diversity of structure suggests a corresponding diversity of 
function, relatively little is known about the specific roles of the cytoplasmic 
"tails" of these proteins. In some channels (e.g., Shaker and Kvl.4), an N- 
terminal peptide has been shown to participate in rapid inactivation, although 

S1-S6 



rKvl.l 
rKvl.2 
rKvl.3 
rKvl.4 
rKvl.5 
rKvl.6 
Shaker 

rKv2.1 
rKv2.2 
Shab 

rKv3.I 
rKv3.2 
rKv3.3 
rKv3.4 
Shaw 

rKv4.1 
rKv4.2 
Shal 

rKv5. 1 
rKv6.i 



lOOaa 

FIGURE 13. Schematic diagram showing the relative lengths of the N-termi- 
nal and C-terminal cytoplasmic "tails" of Kv proteins. The boxed region repre- 
sents the hydrophobic core of the proteins from SI through S6, and size 
differences within this region have been ignored. Those segments known to 
vary in length through alternative mRNA splicing are indicated by open boxes. 
The scale bar indicates a length of 100 residues. 
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there is relatively little sequence conservation of such peptides. The strongly 
conserved region in the N-terminal cytoplasmic region of these proteins (the 
region including the YFFDR motif) could be important in establishing the 
specificity of subunit interactions in its native tetrameric form (Li et al., 1992; 
Shen et al., 1993). Another (short) conserved motif is represented by the two 
adjacent cysteines present in the N-terminal region of all Kv genes other than 
the Shaker subfamily (IYLESCCQARY in rKv2.1, at position 382 in Figure 
12), in a region with no homologue in the Shaker subfamily; a conserved 
cysteine is also located about 25 residues upstream in this same group of 
proteins (HMMEEMCALS in rKv2.1, at position 361 in Figure 12), although 
no function has been ascribed to any of these cysteines. Two additional 
conserved cysteines, one in S2 and the other in S6 (see Figure 12), when 
mutated in Kv2.1, did not affect channel function (Zuhlke et al., 1993). 

2.1.3.2 The P-Region 

Portions of the P-region (spanning some 22 residues) are among the most 
highly conserved sequences within this family of proteins. In fact, as illus- 
trated in the alignment in Figure 14, this region can be identified in all known 
K + -selective channels, including all Kv channels (which we have discussed 
here), calcium-activated K + channels of fly and mouse (Atkinson et al., 1991 ; 
Butler et aL, 1993), inward rectifier K* channels in plants and mammals 
(Anderson et al., 1992; Sentenac et al., 1992; Ho et al., 1993; Kubo et aL, 
1993a,b) including the G-protein-coupled rat G1RK channel (Kubo et al., 
1993b), the fly eag K + channel (Warmke et al., 1991, 1993), and a putative 
K + channel isolated from E. coli (Milkman, 1994; also see Table 1). 

Shaker NSFFKSI PDAFWUAWTMTTVGYGDHTPVGFUGICI V 

rKvl.KRCKD E-H-S = Y--TIG 

Shab 0=K-V---= G= =C-TTAL--== 

rKv2.1<DRKl> D=< AS T= =Y-KTLL 

Shaw HND-N---LGL---=" A-KT=l-KF- 

rKv3.1<KV4) H=H--H~IG = Y-QT=S-M=- 

Shat A-K-T---A---=T= = V-KTIA F 

mKv4.1(m$hal) K=N-T — A — =T= = V-STIA F 

rKv5.1(IK8) E=l QS == =Y-KTTL--=N 

TKV6.UK13) SPE-T---AC= = V-RSTP-Q=- 

FslP fWwa vtMTTvGYGDm P G Kv consensus 

ECOKCH -PRlE-=MT--=S=E--= =V-SESA*==F 

sto -AHRL-YWTCV==l=— = =YCETVL-=TF 

mSlo - NQAL=YW=CV=L L=---= =YAKTTL - == F 

KAT1 A - L=NRYVT -L=-S=T-=--T FHAENPREM=F 

AKT1 £-L=MRYVTSM=-S=T-= =H--NTKEM-F 

R0MK1 VENING=TS--L=S=E-QV-=---FRFVTEGCATA= 

IRK1 V-EVN-FTA--L=S=E-Q--=---FRCVTDECP!A- 

G1RK1 VANVYNF-S-L=F=E-EA-=---YRYlTOKCPEG= 

eag P-RKSMYVT-L==T=TC- -=--=-N=AAETDNE-=F . 

tmttvGyGd all K consensus 

FIGURE 1 4. Amino acid sequence alignment of the putative pore region of K + 
channel proteins (see text). The symbol indicates identity with the topmost 
sequence, while "=" indicates a conservative substitution. The consensus 
sequences for either the Kv family alone (below rKvl.6), or all K + channels 
(bottom), represent positions that either are completely conserved (upper case) 
or at which a single amino acid predominates (lower case). 
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The region aligned in Figure 14 contains 13 residues that are identical in 
all Kv channels, and an additional 5 that show only conservative changes 
within this group. Among the larger group of all K + -selective channels, only 
two residues are perfectly conserved (G[Y/F]G), and an additional three 
positions show exclusively conservative changes. In fact, the eag channel is 
the only known K 4 -selective channel that lacks the tyrosine in the otherwise 
universally conserved GYG motif, replacing it with the very similar pheny- 
lalanine; the recent demonstration that this channel is also permeable to Ca 2+ 
(Bruggemann et a!., 1993) could be relevant in this regard, although the Ca 2+ 
permeability of other K + channels has not been systematically examined. It 
should be noted that Isk/Mink channel lacks any sequence motifs resembling 
the K + channel pore; however, a recent report by Attali et al. (1993) suggests 
that Isk may not be a channel at all, but rather an activator of endogenous K + 
or Cl~ channels present in Xenopus oocytes. 

The presence of this pore motif in both eubacteria and vertebrates sug- 
gests that its evolution predated the early branching of the "tree of life" more 
than three billion years ago. The fact that other cation channels (e.g., Na + and 
Ca 2+ ) do not share this motif suggests that the P-region plays a key role in 
defining the K+ selectivity of these channels, and that acquisition of K + 
selectivity was an ancient evolutionary development. 

The loop linking S5 to the P-region is thought to form part of the outer 
vestibule of the ion conduction pathway (Durell and Guy, 1992), since 
residues in this region have been reported to be involved in toxin binding 
(MacKinnon et al, 1989, 1990; Hurst et al., 1991; Goldstein et al., 1992). 
This loop is more highly conserved in its length than in its sequence, being 
13-15 amino acids long in all but the mammalian Shaw homologues which 
have 22 residues (Figure 12). 
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2.1.3.3 The S4 Segment and Leucine Zipper 

The S4 segment of K + channels is thought to form a major element of the 
voltage sensor, and mutagenesis of its positive charges (and some hydrophobic 
residues) has been shown to alter the gating properties of Shaker (Papazian et 
al., 1991; Liman et al., 1991; Koren et al., 1990; Logothetis et al., 1992, 1993; 
Lopez et al., 1991; also see review by Jan and Jan, 1992). The S4 motif with 
its characteristic pattern of positive charges every third residue is also present 
in Ca 2+ and Na + channels (Catterall, 1988), supporting its importance for 
voltage sensing. The presence of a similar motif in cGMP-gated channels seems 
to be at variance with their ligand-gated properties, but may account for their 
moderate voltage dependence (Kaupp et al., 1989; Ludwig et al., 1990). 

The S4 segment, as indicated in Figure 12, is generally assumed to be 
about 20 amino acids in length, consistent with the presumed length of a- 
helix required to span the cell membrane. Given this disposition, the S4 
segments of the S/idfcer-subfamily channels contain seven conserved positive 
charges (five arginines, two lysines) in the repeated motif, while the Shaw- 
subfamily proteins have six, and S/iafc-subfamily channels as well as Kv5.1 
(IK8) and Kv6.1 (K13) each has five. Alignment of the S/w/-subfamily 



36 



HANDBOOK OF RECEPTORS AND CHANNELS 



channels with the other Kv channels reveals a three amino acid deletion in the 
S4 segment, which eliminates the second arginine in the repeated motif, and 
in place of the last lysine they contain a glutamine, leaving them with only 
five positive charges. However, the Shal proteins contain an additional down- 
stream arginine in phase with the repeated motif (at a position at which 
Shaker and Shab channels have a glutamine), and inclusion of this residue in 
the S4 segment would bring the total number of positive charges to six. If the 
S4 segment is of the same length in these different channels, then this sixth 
positively charged residue would be included in the voltage sensor of the 
mammalian 5/ia/-subfamily proteins, as well as in the fly Shaw protein. Nor 
has the possibility been excluded that the arginine at this position in the 
mammalian Shaw proteins also participates in voltage sensing. 

Voltage sensing in these channels certainly depends on structural fea- 
tures outside the S4 region itself. While specific residues outside S4 influenc- 
ing this process have not yet been identified, charged amino acids at highly 
conserved positions may be attractive candidates. It is tempting, for example, 
to suggest that the conserved arginine present in the S2 segment of all Kv 
protein may influence the voltage dependence of activation of these channels. 
Several negatively charged residues are also conserved in all Kv channels (the 
FFDR motif in the N-terminal region already noted, a glutamic acid in S2, an 
aspartic acid in S3, and a glutamic acid at each end of S5); these and other 
charged residues could influence voltage sensing in the various Kv isoforms. 

A leucine heptad motif is repeated five times at the end of the S4 segment 
and S4/S5 loop of all the Shaker- and Shab- subfamily channels (Figure 12), 
forming a "leucine zipper" motif thought to be important in many protein- 
protein interactions (McCormack et al., 1989; Alber, 1992). Leucines 2 and 
5 (L2 and L5) are present in all Kv channels. LI is also found in all the 
channels, with the exception of the StaZ-subfamily channels," which have a 
phenylalanine at that position. Similarly, phenylalanine replaces L4 in the 
Shaw channels and Kv6.1, while phenylalanine and alanine are present in 
Shaw and Kv6.1 in the place of L3. Substitutions of these leucines in the 
Shaker channel has been shown to alter the voltage-dependent steps during 
channel gating (McCormack et al., 1991; Schopa et al., 1992). While the 
significance of this leucine zipper-like structure is still not understood, it 
clearly can influence channel gating, perhaps through modification of 
interdomain interactions. 
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2.1.3.4 Kv-Channel Phylogenetic Tree 

Figure 15 shows a proposed phylogenetic tree for the extended Kv family. 
This topology was determined by first examining trees including only mem- 
bers of the Kvl, Kv2, Kv3, and Kv4 subfamilies, and constraining the 
relationships within the Shaker and Shaw groups to be those shown in the 
trees in Figures 4 and 10; the most parsimonious tree found in this way makes 
neighbors of the Shaker and Shal groups on the one hand, and the Shaw and 
Shab groups on the other, as in our earlier analyses (Strong et al., 1993). 
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FIGURE 15. Proposed phylogenetic tree of voltage-gated K + channel genes, 
based on parsimony analysis of nucleotide sequence alignments using the 
program PAUP (Swofford, 1993; see text). 

Adding rKv5.1 (IK8) and rKv6.1 (K13) to such a tree places the former on 
the branch leading to Shab, and the latter on that leading to Shal, as shown 
in the figure. However, neither the deep branching between the Kvl and Kv4 
subfamilies in this tree (Shak/Shal versus Shab/Shaw), nor the placement on 
this tree of Kv5. 1 and Kv 1 .6 is compellingly supported by bootstrapping. The 
deep divisions within this group of genes, together with their as yet unre- 
solved rooting (Strong et ah, 1993), may therefore have to await the availabil- 
ity of additional sequence data, or more powerful analytic methods, for their 
satisfactory resolution. 
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Table 2b. Dist 



Gene 


Kvl.l 
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CA1], 
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Kvl.3 
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striatu 
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01 facte 
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Cerebe 
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Kv2.1 


Cerebri 


Kv2.2 
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Cerebe 




cortic; 




while 




embr> 


Kv3.2 
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Kv3.3 
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Table 2b. Distribution of Voltage-Gated Potassium Channel Genes in Mammalian Brain 



Gene 



Comments 



Kvl 1 Pons/medulla'>rnid brain.' superior and inferior colliculus,' cerebellum' >hippocampus 

[CA3>dentate gyrus>CAl],'- 2 thalamus ? cerebral cortex'- 1 corpus striatum.' olfactory bulb 
Kv I 2 Pons/medulla' > cerebellum,'- 2 inferior colliculus' >hippocampus tCA3>dentate gyrus = 
CAl),'- 2 thalamus, 2 cerebral cortex.'- 2 superior coiliculus' >mid brain,' corpus stnatum. 
olfactory bulb' , 
Kvl.3 Inferior colliculus' >olfactory bulb,' pons/medulla' >mid brain,' supenor colliculus.' corpus 

striatum,' hippocampus,' cerebral cortex' ..... 
Kv 1 4 Olfactory bulb,' corpus striatum,' > hippocampus,'-" superior and inferior colliculus 

>cerebral cortex,' mid brain,' basal ganglia' 7 > pons/medulla' 
Kvl.5 Cerebellum [purkinje and granular cells). 22 hypothalamus 24 
Kv 1 6 Medulla/pons 25 > inferior colliculus 25 > corpus striatum 25 
Kv2 1 Cerebral cortex" > hippocampus 22 > cerebellum 22 > olfactory bulb 22 
Kv2.2 Olfactory bulb [granule cell layer>ol factory tubercle] 2 ' >cortex 2 < >hippocampus [CA1-CA4 

region, dentate gyrus] 2 * >cerebellum[granule cell layer] 26 
Kv3 1 Cerebellum" >globus pallidus. 27 subthalamus. 27 substantia nigra" > reticular thalamic nuclei 
cortical and hippocampal interneurons 27 inferior collicuh » cochlear and vestibular nuclei- 
while Kv3 la is expressed mainly in the adult brain Kv3.1b is the major transcript in 
embryonic and perinatal neurons; their distribution within the brain is identical 
Kv3.2 Thalamus 1 ' neocortex [layers 4-6]." piriform nucleus." red nucleus," hippocampus [CA3] 
Kv3.3 Cerebellum [purkinje cells]"- 33 
Kv3.4 Brain 54 

Kv4.2 Cerebellum (granular cells}' 7 >hippocampus[granule cells. CA3/CA1 pyramidal cells],' 7 

thalamus, 17 medial habenular nucleus' 7 >cerebral cortex' 7 
Kv5. 1 Brain 32 

Kv6.l Brain 22 
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Table 3. (Continued) ' 

' TEA, tctraemylommomurn; CTX, chary bdo tox in; bi'X, dendroloxtn; MCbP, mast ceil flcgranutanng peptide; 4-AP, 4-am.nopyr.dtnc. Keponed 
values axe rounded to two significant figures, and comparable values differing by less than 10% are shown as identical. 
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